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CLAIMS: 

1. A method of compressing a data file comprising a sequence of bytes of a length 
greater than or equal to a predefined length, the method including the steps ofi 
5 retrieving the data file from a secondary storage device; 

storing the data file in direct access memory; 

calculating the firequency of unique byte values within a sub-sequence of the 
data file, the sub-sequence having a length not exceeding the predefined length; 

creating an index for the sub-sequence, the index including a data value 
10 representing the calculated fiiequ^cy of unique byte values within the sub-sequence; 
and 

on tfie sub-sequence having a frequency of unique byte values below a 
predetermined threshold, applying a data transformation to the sub-sequence to increase 
the frequency of unique byte values in the sub-sequence and adding to the ind^ a data 
1 S value represmting fhe data transformation; 

on the sub-sequence having a frequency of unique byte values above a 
predefined threshold, adding to the index a data value rqiresenting the position of one 
or more unique values within the sub-sequence; 

a-eating an output data file, the data file having a file type identifier, and 
20 adding the index to tiie output data file. 

2. A mefiiod of compressing a data file as claimed in claim 1 wherein the step of 
applying a data transformation to the sub-sequence further includes fhe steps oft 

maintaining in compute m^ory a plurality of transformation data sets, the 
25 data sets having a sequence of byte values and identified by a transformation data set 
identifier, 

retrieving one of tiie transformation data sets from computer memory, tiie 
retrieved transfoimation data set having a length substantially equal to tiie lengtii of fhe 
sub-sequence of fhe data file; and 
30 q>plying a data transformation to respective byte values in the sub-sequence 

based on corresponding byte values in the retrieved data set. 
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3. A method of comprising a data file as claimed in claim 2 wh^in the sub- 
sequence following data transfomiation based on at least one of die retrieved 
transformation data set is substantially id^tical to the sub-sequence prior to the data 
transformation. 

5 

4. A method of comprising a data file as daimed in claim 2 wherein at least one 
of the transformation data sets comprises a randomly generated sequrace of byte rates. 

5. A method of compressing a data file as claimed in claim 2 wherein at least one 
10 of the transformation data sets comprises a predefined sequence of byte rates. 

6. A method of comprising a data file as claimed in claim 2 wherein at least one 
of the transformation data sets comprises a sequoice of byte values derived fit>m a 
portion of the data file other than the sub-sequence of the data file. . 

15 

7. A method of compressing a data file as claimed in any one of claims 2 to 6 
further comprising the step of adding to the index the transformation data set identifier 
of the data transformation data set applied to the sub-sequence. 

20 8. A method of compressing a data file as claimed in any one of the preceding 
claims furttier comprising the step of calculating the position' of the one or more unique 
values within the sub-sequence. 

9. A method of compressing a data file as claimed in clain^ 8 wherein the step of 
25 calculating the position of flie one or more unique values within the suh-sequence 
further comprises the steps ofi 

oreating a temporary position index in computer memory; 
retrieving successive byte values firom the sub-sequence; 
on retrieving each byte value, determining whether tiie retrieved byte value is a 
30 unique byte value or a repeated value; 
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on detection of a unique byte value, adding one of two bit values to fhe 
ten^>orary position index otherwise adding the ottier of die two bit vahies to the 
tCTiporary position index; 

seating a position index rq)resenting the position of the one or more unique 
5 vahies fixmi the t^porary position index; and 

calculating the data value representing the position of die one or more unique 
values at least paitiy fiom the position index. 

10. A mefliod of compressing a data file as .claimed in claim 9 wherein the number 
10 of bytes in flie sub-sequence is substantially equal to the number of bits in the temporary 
position index. 

11- A metiiod of compressing a data file as claimed in claim 9 or claim 10 wherem 
the size of the position index is less than the size of the traiporary position index. 

15 

12. A method of compressing a data file as clauned in any one of daims 9 to 11 
furtiier comprising the steps of: 

oreating a permutation index representing the ordo: of unique byte values 
within the sub-sequence; and 
20 calculating the data value re|)resenting the position of the one or more unique 

values both fiom the position index and the permutation index. 
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13. A method of compressing a data file as claimed in claim 12 comprising the step 
of concatenating the position index and the pamutation index to form the data value 
representing the position of the one or more unique values. 



